Apparatuses and methods for parsing and comparing video resume duplications

ABSTRACT

Aspects relate to apparatuses and methods for parsing and comparing resume video duplications. An exemplary apparatus includes a memory communicatively connected to at least a processor and includes instructions configuring the at least a processor to acquire a plurality of video elements from an existing video resume, wherein the existing video resume includes at least an image component, recognize subject-specific data of the existing video resume as a function of the at least an image component, wherein the subject-specific data includes verbal content and non-verbal content, recognize at least a keyword of the existing video resume as a function of the subject-specific data, recognize at least a feature of the existing video resume as a function of the subject-specific data, compare the subject-specific data to a target video resume and determine, as a function of the comparison result, a duplication coefficient for the target resume video.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of Non-Provisional Application No.17/582,070 filed on Jan. 24, 2022, and entitled “APPARATUSES AND METHODSFOR PARSING AND COMPARING VIDEO RESUME DUPLICATIONS,” the entirety ofwhich is incorporated herein by reference.

FIELD OF THE INVENTION

The present invention relates to the field of selective visual displayapparatuses. In particular, the present invention is directed toapparatuses and methods for parsing and comparing resume videoduplications.

BACKGROUND

Video content (including short-form video content) has steadily beenrising in popularity for many years. Its advantages as a communicationmedium are manifest. However, present ways of doing business, which arecommonly reliant upon the written word, are not capable of makingoptimal use of this form of media.

SUMMARY OF THE DISCLOSURE

In an aspect, an apparatus for parsing and comparing resume videoduplications including a computing device configured to acquire aplurality of video elements from an existing video resume, wherein theexisting video resume includes at least an image component, recognizesubject-specific data of the existing video resume as a function of theat least an image component, wherein the subject-specific data includesverbal content and non-verbal content, recognize at least a keyword ofthe existing video resume as a function of the subject-specific data,recognize at least a feature of the existing video resume as a functionof the subject-specific data, compare the subject-specific data to atarget video resume and determine, as a function of the comparisonresult, a duplication coefficient for the target resume video.

In another aspect, a method of parsing and comparing resume videoduplications includes recognizing, using the at least a processor,subject-specific data of the existing video resume as a function of theat least an image component, wherein the subject-specific data includesverbal content and non-verbal content, recognizing, using the at least aprocessor, at least a keyword of the existing video resume as a functionof the subject-specific data, recognizing, using the at least aprocessor, at least a feature of the existing video resume as a functionof the subject-specific data, comparing, using the at least a processor,the subject-specific data to a target video resume and determining,using the at least a processor and as a function of the comparisonresult, a duplication coefficient for the target resume video.

These and other aspects and features of non-limiting embodiments of thepresent invention will become apparent to those skilled in the art uponreview of the following description of specific non-limiting embodimentsof the invention in conjunction with the accompanying drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

For the purpose of illustrating the invention, the drawings show aspectsof one or more embodiments of the invention. However, it should beunderstood that the present invention is not limited to the precisearrangements and instrumentalities shown in the drawings, wherein:

FIG. 1 is a block diagram of an exemplary apparatus for parsing andcomparing video resume content to target video resume content;

FIG. 2 illustrates an exemplary image frame;

FIG. 3 is a table representing an exemplary association between multiplevideo resumes and a target video resume;

FIG. 4 illustrates an exemplary video resume database using a blockdiagram;

FIG. 5 illustrates by way of a block diagram an exemplary training datadatabase;

FIG. 6 is a block diagram of exemplary machine-learning processes;

FIG. 7 is a flow diagram of an exemplary method of parsing and comparingsolicitation video resume content to target video resume;

FIG. 8 illustrates an exemplary embodiment of a neural network;

FIG. 9 is a block diagram of an exemplary embodiment of a node of aneural network; and

FIG. 10 is a block diagram of a computing apparatus that can be used toimplement any one or more of the methodologies disclosed herein and anyone or more portions thereof.

The drawings are not necessarily to scale and may be illustrated byphantom lines, diagrammatic representations, and fragmentary views. Incertain instances, details that are not necessary for an understandingof the embodiments or that render other details difficult to perceivemay have been omitted.

DETAILED DESCRIPTION

At a high level, aspects of the present disclosure are directed toapparatus and methods for parsing and comparing resume videoduplications.

Aspects of the present disclosure can be used to automatically analyzeand sort applicants according to their resume video. Aspects of thepresent disclosure can also be used to add appeal to a job application.This is so, at least in part, because information request in sorting anapplicant is inherently present within a video resume. However, noapparatus presently facilitates the automatic sorting of applicant basedupon their video resumes.

Aspects of the present disclosure allow for practical improvement overcurrent state of art for applicant sorting by allowing for video data tobe parsed and compared to identify redundancies in stored information.Exemplary embodiments illustrating aspects of the present disclosure aredescribed below in the context of several specific examples.

Referring now to FIG. 1 , an exemplary embodiment of apparatus 100 forparsing and comparing resume video duplications is illustrated.Apparatus 100 includes computing device 104. Computing device 104 mayinclude any computing device as described in this disclosure, includingwithout limitation a microcontroller, microprocessor, digital signalprocessor (DSP) and/or system on a chip (SoC) as described in thisdisclosure. Computing device 104 may include, be included in, and/orcommunicate with a mobile device such as a mobile telephone orsmartphone. Computing device 104 may include a single computing deviceoperating independently, or may include two or more computing deviceoperating in concert, in parallel, sequentially or the like; two or morecomputing devices may be included together in a single computing deviceor in two or more computing devices. Computing device 104 may interfaceor communicate with one or more additional devices as described below infurther detail via a network interface device. Network interface devicemay be utilized for connecting computing device 104 to one or more of avariety of networks, and one or more devices. Examples of a networkinterface device include, but are not limited to, a network interfacecard (e.g., a mobile network interface card, a LAN card), a modem, andany combination thereof. Examples of a network include, but are notlimited to, a wide area network (e.g., the Internet, an enterprisenetwork), a local area network (e.g., a network associated with anoffice, a building, a campus or other relatively small geographicspace), a telephone network, a data network associated with atelephone/voice provider (e.g., a mobile communications provider dataand/or voice network), a direct connection between two computingdevices, and any combinations thereof. A network may employ a wiredand/or a wireless mode of communication. In general, any networktopology may be used. Information (e.g., data, software etc.) may becommunicated to and/or from a computer and/or a computing device.Computing device 104 may include but is not limited to, for example, acomputing device or cluster of computing devices in a first location anda second computing device or cluster of computing devices in a secondlocation. Computing device 104 may include one or more computing devicesdedicated to data storage, security, distribution of traffic for loadbalancing, and the like. Computing device 104 may distribute one or morecomputing tasks as described below across a plurality of computingdevices of computing device, which may operate in parallel, in series,redundantly, or in any other manner used for distribution of tasks ormemory between computing devices. Computing device 104 may beimplemented using a “shared nothing” architecture in which data iscached at the worker, in an embodiment, this may enable scalability ofapparatus 100 and/or computing device.

With continued reference to FIG. 1 , computing device 104 may bedesigned and/or configured to perform any method, method step, orsequence of method steps in any embodiment described in this disclosure,in any order and with any degree of repetition. For instance, computingdevice 104 may be configured to perform a single step or sequencerepeatedly until a desired or commanded outcome is achieved; repetitionof a step or a sequence of steps may be performed iteratively and/orrecursively using outputs of previous repetitions as inputs tosubsequent repetitions, aggregating inputs and/or outputs of repetitionsto produce an aggregate result, reduction or decrement of one or morevariables such as global variables, and/or division of a largerprocessing task into a set of iteratively addressed smaller processingtasks. Computing device 104 may perform any step or sequence of steps asdescribed in this disclosure in parallel, such as simultaneously and/orsubstantially simultaneously performing a step two or more times usingtwo or more parallel threads, processor cores, or the like; division oftasks between parallel threads and/or processes may be performedaccording to any protocol suitable for division of tasks betweeniterations. Persons skilled in the art, upon reviewing the entirety ofthis disclosure, will be aware of various ways in which steps, sequencesof steps, processing tasks, and/or data may be subdivided, shared, orotherwise dealt with using iteration, recursion, and/or parallelprocessing.

Still referring to FIG. 1 , computing device 104 is configured toacquire a plurality of video elements from a target video resume 108. Asused in this disclosure, “video elements” are diverse types of featuresfrom a video resume such as image features, frame features, soundfeatures, graphical features, and the like. As used in this disclosure,a “video resume” is a video in visual and/or audio form to provide arecording promoting a jobseeker. In some cases, video resume 112 mayinclude content that is representative or communicative of at leastattribute 124 of subject 120. As used in this disclosure, a “subject” isa person, for example a jobseeker. Subject 120 may be representeddirectly by video resume 112. For example, in some cases, imagecomponent 116 a may include an image of subject 120. As used in thisdisclosure, an “image component” may be a visual representation ofinformation, such as a plurality of temporally sequential frames and/orpictures, related to video resume and target video resume. For example,image component 116 a-b may include animations, still imagery, recordedvideo, and the like. Attributes 124 may include subject’s 120 skills,competencies, credentials, talents, and the like. In some cases,attributes 124 may be explicitly conveyed within video resume 112.Alternatively, or additionally, in some cases, attributes 124 may beconveyed implicitly with video resume 112.

As used in this disclosure, a “target video resume” includes at leastimage component 116 b that may be selected based on a request by a useror job-applicant from one or more databases in which multiple videoresumes 112 of users or job-applicants are stored. Target video resume108 may be the basis for computing device 104 to search for other videoresumes 112 based on target resume 108 video properties. Video elementsfrom target video 108 may relate to diverse types of features of targetvideo resume 108. As used in this disclosure, a “feature” is anindividually measurable property or characteristic such as imagefeature, frame feature, sound feature, graphical feature, and textualfeature. These features may be in the form of a floating-point numberfeature, a binarized feature, a fence feature, a recall feature, apooling feature, a reranking feature, and the like. Target video resume108 may be communicated by way of digital signals, for example betweencomputing devices which are communicatively connected with at least awireless network. Digital video may be compressed to optimize speedand/or cost of transmission of video. Videos may be compressed accordingto a video compression coding format (i.e., codec). Exemplary videocompression codecs include H.26× codecs, MPEG formats, VVC, SVT-AV1, andthe like. In some cases, compression of a digital video may be lossy, inwhich some information may be lost during compression. Alternatively, oradditionally, in some cases, compression of a digital video may besubstantially lossless, where substantially no information is lostduring compression.

Still referring to FIG. 1 , Video resume 112 may be representativesubject-specific data. As used in this disclosure, “subject-specificdata” is any element of information that is associated with a specificsubject. Exemplary forms of subj ect-specific data include imagecomponent 116 a-b, video resume 112, non-verbal content 132, verbalcontent 136, audio component, as well as any information deriveddirectly or indirectly from video resume 112 or any othersubject-specific data. For example, subject-specific data could be thephysical properties of subject 120, such as their body posture or facialexpression. Subject-specific data could also be audio sensory propertiesof subject 120, such as tone of voice or background audio in a resumevideo 112.

In some cases, video resume 112 may include non-verbal content 132. Asused in this disclosure, “non-verbal content” is all communication thatis not characterized as verbal content. As used in this disclosure,“verbal content” is comprehensible language-based communication. Forexample, verbal content may include “visual verbal content” which isliteral and/or written verbal content. Non-verbal content 132 includesall forms of communication which are not conveyed with use of language.Exemplary non-verbal content may include change in intonation and/orstress in a speaker’s voice, expression of emotion, and the like. Forexample, in some cases, non-verbal content may include visual non-verbalcontent. As used in this disclosure, “visual non-verbal content” isnon-verbal content 132 that is visually represented. In some cases,visual non-verbal content may be included within video resume 112 by wayof image component 116 a.

In some cases, a non-verbal classifier may classify non-verbal content132 present in one or more image component 116 a to one or more oftarget video resume 108, a feature. Non-verbal classifier may include anumber of classifiers, for example each being tasked with classifying aparticular attribute 124 or form of non-verbal content 132. For example,in some cases, non-verbal classifier may classify a video resume 112 andrelated subject 120 as associated with a feature representative of‘personable.’ Non-verbal classifier may include another specializedvisual non-verbal classifier to classify visual non-verbal content asappearing ‘personable’ that is, for example, as having appropriateposture, facial expressions, manner of dress, and the like. In somecases, classifier may include or a constituent part of tree structure,for making associations based upon video resume.

With continued reference to FIG. 1 , in some embodiments, imagecomponent 116 a-b may include or otherwise represent verbal content 136.For instance, written or visual verbal content may be included withinimage component 116 a-b. Visual verbal content may include images ofwritten text represented by image component 116 a-b. For example, visualverbal content may include, without limitation, digitally generatedgraphics, images of written text (e.g., typewritten, and the like),signage, and the like.

Still referring to FIG. 1 , in some embodiments, image component 116 a-bmay include or otherwise represent audible verbal content related to atleast an attribute 124 of subject 120. As used in this disclosure,“audible verbal content” is oral (e.g., spoken) verbal content. In somecases, audible verbal content may be included within video resume 112 byway of an audio component. As used in this disclosure, an “audiocomponent” is a representation of audio, for example a sound, a speech,and the like. In some cases, verbal content 136 may be related to atleast an attribute of subject. Additionally, or alternatively, visualverbal content and audible verbal content may be used as inputs toclassifiers as described throughout this disclosure.

In some cases, computing device 104 may include audiovisual speechrecognition (AVSR) processes to recognize verbal content 136 in videoresumes 112. For example, computing device 104 may use image content 116a-b to aid in recognition of audible verbal content such as viewingsubject 120 move their lips to speak on video to process the audiocontent of video resume 112. AVSR may use image component 116 a-b to aidthe overall translation of the audio verbal content of video resumes112. In some embodiments, AVSR may include techniques employing imageprocessing capabilities in lip reading to aid speech recognitionprocesses. In some cases, AVSR may be used to decode (i.e., recognize)indeterministic phonemes or help in forming a preponderance amongprobabilistic candidates. In some cases, AVSR may include an audio-basedautomatic speech recognition process and an image-based automatic speechrecognition process. AVSR may combine results from both processes withfeature fusion. Audio-based speech recognition process may analysisaudio according to any method described herein, for instance using a Melfrequency cepstrum coefficients (MFCCs) and/or log-Mel spectrogramderived from raw audio samples. Image-based speech recognition mayperform feature recognition to yield an image vector. In some cases,feature recognition may include any feature recognition processdescribed in this disclosure, for example a variant of a convolutionalneural network. In some cases, AVSR employs both an audio datum and animage datum to recognize verbal content 136. For instance, audio vectorand image vector may each be concatenated and used to predict speechmade by a subject 120, who is ‘on camera.’

In some cases, computing device 104 may be configured to recognize atleast a keyword 140 as a function of visual verbal content. In somecases, recognizing at least keyword 140 may include optical characterrecognition. As used in this disclosure, a “keyword” is an element ofword or syntax used to identify and/or match elements to each other. Insome cases, computing device 104 may transcribe much or evensubstantially all verbal content 136 from target resume video 108.

Still refereeing to FIG. 1 , in some embodiments, optical characterrecognition or optical character reader (OCR) includes automaticconversion of images of written (e.g., typed, handwritten or printedtext) into machine-encoded text. In some cases, recognition of at leasta keyword 140 from an image component 116 a-b may include one or moreprocesses, including without limitation optical character recognition(OCR), optical word recognition, intelligent character recognition,intelligent word recognition, and the like. In some cases, OCR mayrecognize written text, one glyph or character at a time. In some cases,optical word recognition may recognize written text, one word at a time,for example, for languages that use a space as a word divider. In somecases, intelligent character recognition (ICR) may recognize writtentext one glyph or character at a time, for instance by employingmachine-learning processes. In some cases, intelligent word recognition(IWR) may recognize written text, one word at a time, for instance byemploying machine-learning processes.

Still referring to FIG. 1 , in some cases OCR may be an “offline”process, which analyses a static document or image frame. In some cases,handwriting movement analysis can be used as input to handwritingrecognition. For example, instead of merely using shapes of glyphs andwords, this technique may capture motions, such as the order in whichsegments are drawn, the direction, and the pattern of putting the pendown and lifting it. This additional information may make handwritingrecognition more accurate. In some cases, this technology may bereferred to as “online” character recognition, dynamic characterrecognition, real-time character recognition, and intelligent characterrecognition.

Still referring to FIG. 1 , in some cases, OCR processes may employpre-processing of image component 116 a-b. Pre-processing process mayinclude without limitation de-skew, de-speckle, binarization, lineremoval, layout analysis or “zoning,” line and word detection, scriptrecognition, character isolation or “segmentation,” and normalization.In some cases, a de-skew process may include applying a transform (e.g.,homography or affine transform) to Image component 116 a-b to aligntext. In some cases, a de-speckle process may include removing positiveand negative spots and/or smoothing edges. In some cases, a binarizationprocess may include converting an image from color or greyscale toblack-and-white (i.e., a binary image). Binarization may be performed asa simple way of separating text (or any other desired image component)from a background of image component. In some cases, binarization may berequired for example if an employed OCR algorithm only works on binaryimages. In some cases, a line removal process may include removal ofnon-glyph or non-character imagery (e.g., boxes and lines). In somecases, a layout analysis or “zoning” process may identify columns,paragraphs, captions, and the like as distinct blocks. In some cases, aline and word detection process may establish a baseline for word andcharacter shapes and separate words, if necessary. In some cases, ascript recognition process may, for example in multilingual documents,identify script allowing an appropriate OCR algorithm to be selected. Insome cases, a character isolation or “segmentation” process may separatesignal characters, for example character-based OCR algorithms. In somecases, a normalization process may normalize aspect ratio and/or scaleof image component.

Still referring to FIG. 1 , in some embodiments an OCR process mayinclude an OCR algorithm. Exemplary OCR algorithms include matrixmatching process and/or feature extraction processes. Matrix matchingmay involve comparing an image to a stored glyph on a pixel-by-pixelbasis. In some case, matrix matching may also be known as “patternmatching,” “pattern recognition,” and/or “image correlation.” Matrixmatching may rely on an input glyph being correctly isolated from therest of the image component. Matrix matching may also rely on a storedglyph being in a similar font and at a same scale as input glyph. Matrixmatching may work best with typewritten text.

Still referring to FIG. 1 , in some embodiments, an OCR process mayinclude a feature extraction process. In some cases, feature extractionmay decompose a glyph into at least a feature 128. Exemplarynon-limiting features may include corners, edges, lines, closed loops,line direction, line intersections, and the like. In some cases, featureextraction may reduce dimensionality of representation and may make therecognition process computationally more efficient. In some cases,extracted feature 128 may be compared with an abstract vector-likerepresentation of a character, which might reduce to one or more glyphprototypes. General techniques of feature detection in computer visionare applicable to this type of OCR. In some embodiments,machine-learning processes like nearest neighbor classifiers (e.g.,k-nearest neighbors algorithm) may be used to compare image featureswith stored glyph features and choose a nearest match. OCR may employany machine-learning process described in this disclosure, for examplemachine-learning processes described with reference to FIG. 6 .Exemplary non-limiting OCR software includes Cuneiform and Tesseract.Cuneiform is a multi-language, open-source optical character recognitionsystem originally developed by Cognitive Technologies of Moscow, Russia.Tesseract is free OCR software originally developed by Hewlett-Packardof Palo Alto, California, United States.

Still referring to FIG. 1 , in some cases, OCR may employ a two-passapproach to character recognition. A first pass may try to recognize acharacter. Each character that is satisfactory is passed to an adaptiveclassifier as training data. The adaptive classifier then gets a chanceto recognize characters more accurately as it further analyzes imagecomponents. Since the adaptive classifier may have learned somethinguseful a little too late to recognize characters on the first pass, asecond pass is run over the image components.. Second pass may includeadaptive recognition and use characters recognized with high confidenceon the first pass to recognize better remaining characters on the secondpass. In some cases, two-pass approach may be advantageous for unusualfonts or low-quality image components where visual verbal content may bedistorted. Another exemplary OCR software tool include OCRopus. OCRopusdevelopment is led by German Research Centre for Artificial Intelligencein Kaiserslautern, Germany. In some cases, OCR software may employneural networks,

Still referring to FIG. 1 , in some cases, OCR may includepost-processing. For example, OCR accuracy may be increased, in somecases, if output is constrained by a lexicon. A lexicon may include alist or set of words that are allowed to occur in a document. In somecases, a lexicon may include, for instance, all the words in the Englishlanguage, or a more technical lexicon for a specific field. In somecases, an output stream may be a plain text stream or file ofcharacters. In some cases, an OCR process may preserve an originallayout of visual verbal content. In some cases, near-neighbor analysiscan make use of co-occurrence frequencies to correct errors, by notingthat certain words are often seen together. For example, “Washington,D.C.” is generally far more common in English than “Washington DOC.” Insome cases, an OCR process may make us of a priori knowledge of grammarfor a language being recognized. For example, grammar rules may be usedto help determine if a word is likely to be a verb or a noun. Distanceconceptualization may be employed for recognition and classification.For example, a Levenshtein distance algorithm may be used in OCRpost-processing to further optimize results.

Still referring to FIG. 1 , acquiring plurality of video elements mayinclude identifying a series of frames of target video resume 108,identifying a corresponding series of frames of at least an existingvideo resume 112, and comparing the series of frames of target videoresume 108 to existing video resume 112. An existing video resume isvideo resume 112 that has been uploaded to a video resume database. Theseries of frames may include a group of pictures having some degree ofinternal similarity, such as a group of pictures representing a scene.In some embodiments, comparing series of frames may include videocompression by inter-frame coding. The “inter” part of the term refersto the use of inter frame prediction. This kind of prediction tries totake advantage from temporal redundancy between neighboring framesenabling higher compression rates. Video data compression is the processof encoding information using fewer bits than the originalrepresentation. Any compression is either lossy or lossless. Losslesscompression reduces bits by identifying and eliminating statisticalredundancy. No information is lost in lossless compression. Lossycompression reduces bits by removing unnecessary or less importantinformation. Typically, a device that performs data compression isreferred to as an encoder, and one that performs the reversal of theprocess (decompression) as a decoder. Compression is useful because itreduces the resources required to store and transmit data. Computationalresources are consumed in the compression and decompression processes.Data compression is subject to a space-time complexity trade-off. Forinstance, a compression scheme for video may require expensive hardwarefor the video to be decompressed fast enough to be viewed as it is beingdecompressed, and the option to decompress the video in full beforewatching it may be inconvenient or require additional storage. Videodata may be represented as a series of still image frames. Such datausually contains abundant amounts of spatial and temporal redundancy.Video compression algorithms attempt to reduce redundancy and storeinformation more compactly.

Still referring to FIG. 1 , inter-frame coding works by comparing eachframe in the video with the previous one. Individual frames of a videosequence are compared from one frame to the next, and the videocompression codec sends only the differences to the reference frame. Ifthe frame contains areas where nothing has moved, the system can simplyissue a short command that copies that part of the previous frame intothe next one. If sections of the frame move in a simple manner, thecompressor can emit a (slightly longer) command that tells thedecompressor to shift, rotate, lighten, or darken the copy. Usually, theencoder will also transmit a residue signal which describes theremaining more subtle differences to the reference imagery. Usingentropy coding, these residue signals have a more compact representationthan the full signal. In areas of video with more motion, thecompression must encode more data to keep up with the larger number ofpixels that are changing. As used in this disclosure, reference framesare frames of a compressed video (a complete picture) that are used todefine future frames. As such, they are only used in inter-framecompression techniques. Some modern video encoding standards, such asH.264/AVC, allow the use of multiple reference frames. This allows thevideo encoder to choose among more than one previously decoded frame onwhich to base each macroblock in the next frame. While the best framefor this purpose is usually the previous frame, the extra referenceframes can improve compression efficiency and/or video quality. The twoframe types used in inter-fame coding is P-frames and B-frames. AP-frame (Predicted picture) holds only the changes in the image from theprevious frame. For example, in a scene where a car moves across astationary background, only the car’s movements need to be encoded. Theencoder does not need to store the unchanging background pixels in theP-frame, thus saving space. A B-frame (Bidirectional predicted picture)saves even more space by using differences between the current frame andboth the preceding and following frames to specify its content. An intercoded frame is divided into blocks known as macroblocks. A macroblock isa processing unit in image and video compression formats based on linearblock transforms, typically the discrete cosine transform (DCT). Amacroblock typically consists of 16×16 samples, and is furthersubdivided into transform blocks, and may be further subdivided intoprediction blocks. Formats which are based on macroblocks include JPEG,where they are called MCU blocks, H.261, MPEG-1 Part 2, H.262/MPEG-2Part 2, H.263, MPEG-4 Part 2, and H.264/MPEG-4 AVC. After the intercoded frame is divided into macroblocks, instead of directly encodingthe raw pixel values for each block, the encoder will try to find ablock similar to the one it is encoding on a previously encoded frame,referred to as a reference frame. This process is done by a blockmatching algorithm. If the encoder succeeds on its search, the blockcould be encoded by a vector, known as motion vector, which points tothe position of the matching block at the reference frame. The processof motion vector determination is called motion estimation. In mostcases the encoder will succeed, but the block found is likely not anexact match to the block it is encoding. This is why the encoder willcompute the differences between them. Those residual values are known asthe prediction error and need to be transformed and sent to the decoder.To sum up, if the encoder succeeds in finding a matching block on areference frame, it will obtain a motion vector pointing to the matchedblock and a prediction error. Using both elements, the decoder will beable to recover the raw pixels of the block. For example, video resumes112 and 108 may be compressed using a P-frame algorithm and broken downinto macroblocks. Individual still images taken from video resume 112can then be compared against a reference frame taken from target videoresume 108. A P-frame from video resume 112 would only hold the changesin image from target video resume 108. For example, if both videoresumes 112 and 108 include a subject 120 speaking in a similar manner,then what would be encoded and stored would be subtle changes such as aforeign object entering the background of video resume 112 compared tono foreign object in the background of the reference frame from targetvideo resume 108. Exemplary video compression codecs include withoutlimitation H.26× codecs, MPEG formats, VVC, SVT-AV1, and the like. Insome cases, compression may be lossy, in which some information may belost during compression. Alternatively, or additionally, in some cases,compression may be substantially lossless, where substantially noinformation is lost during compression. In some cases, image component116 a-b may include a plurality of temporally sequential frames. In somecases, each frame may be encoded (e.g., bitmap or vector-basedencoding). Each frame may be configured to be displayed by way of adisplay. Exemplary displays include without limitation light emittingdiode (LED) displays, cathode ray tube (CRT) displays, liquid crystaldisplays (LCDs), organic LEDs (OLDs), quantum dot displays, projectors(e.g., scanned light projectors), and the like.

With continued reference to FIG. 1 , computing device 104 is configuredto compare at least existing video resume 112 and target video resume108 to obtain a comparison result. An initial pass may be used bycomputing device 104 to sort elements of video resumes 112 intocategories, and a subsequent pass may involve detailed comparison ofcategory-matched video elements from at least two video resumes 112 toone another. For example, the initial pass may include classifying theplurality of existing video resumes 112 based on, image component 116 a,audio component, attributes 124, or at least identifying subject 120indica. For example, identifying indica could include name of user orsubject 120, account number, social security number, telephone number,address, and the like. In some embodiments, computing device 104 mayutilize a candidate classifier, which may include any classifier usedthroughout this disclosure, to run an initial pass over the videoelements of video resumes 112, break down and categorizes such elementsbefore comparing it to target video resume 108. A “classifier,” as usedin this disclosure is a machine-learning model, such as a mathematicalmodel, neural net, or program generated by a machine learning algorithmknown as a “classification algorithm that sorts inputs into categoriesor bins of data, outputting the categories or bins of data and/or labelsassociated therewith. A classifier may be configured to output at leasta datum that labels or otherwise identifies a set of data that areclustered together, found to be close under a distance metric, or thelike. As used in this disclosure, a “candidate classifier” is aclassifier that classifies subjects 120 to target video resume 112 or anoverall job description. In some cases, candidate classifier may includea trained machine-learning model, which is trained using candidatetraining data. As used in this disclosure, “candidate training data” isa training data that correlates one or more of subjects 120,subject-specific data, and subject attributes 124 to one or more jobdescriptions, description-specific data, and job description data. Asused in this disclosure, a “job description datum” is an element ofinformation associated with a job description. Target video resume 108may be representative of such job descriptive data. For example, in theinitial pass, video resumes 112 may be categorized based on subject’s120 attributes 124 such as credentials. As used in this disclosure,“credentials” are any piece of information that indicates anindividual’s qualification to perform a certain task or job. Allexisting video resumes 112 may be grouped based on level of experience,educational history, certifications, and the like.

Still referring to FIG. 1 , after the initial pass, during thesubsequent pass, two or more existing video resumes 112 may be comparedagainst one another and ranked based on similarity or relevance totarget video resume 108 before an overall comparison result of videoresumes 112 and target video 108 is computed. As used in thisdisclosure, “relevance” is a measure of closeness of association. Forexample, if candidate classifier grouped video resumes 112 togetherbased on years of experience, candidate classifier may then compare thevideo resumes in that group to output a relevance metric of videoresumes 112 that most align with target video 108. In some cases,relevance metric may be a quantified metric, for example in arbitraryunits or relative unit (e.g., percent).

Still referring to FIG. 1 , in some embodiments, after the initial andsubsequent pass are ran, computing device may utilize the data gatheredfrom the candidate classifier to calculate an overall comparison scoreof existing video resumes 112 to target video resume 108. In some cases,comparison result may contain a comparison score 148 that represents adegree of similarity between target video resume 108 and existing videoresume 112 of the plurality of existing video resumes. Comparison score148 may be determined by dynamic time warping (DTW) based on asimilarity matrix. Dynamic time warping may include algorithms formeasuring similarity between two sequences, which may vary in time orspeed. For instance, similarities in walking patterns may be detected,even if in one video the person was walking slowly and if in another heor she were walking more quickly, or even if there were accelerationsand deceleration during one observation. DTW has been applied to video,audio, and graphics - indeed, any data that can be turned into a linearrepresentation can be analyzed with DTW. In some cases, DTW may allowcomputing device 104 to find an optimal match between two givensequences (e.g., time series) with certain restrictions. That is, insome cases, sequences can be “warped” non-linearly to match each other.

With continued reference to FIG. 1 , in some embodiments, computingdevice 104 may extract or otherwise recognize at least feature 128 fromvideo resume 112. Feature 128 may be recognized and/or extracted fromimage component 116 a of existing video resumes 112. In some cases,features 128 may be recognized, which are associated with non-verbalcontent 132. For example, in some cases, visual non-verbal content suchas expression of subject’s 120 emotion may be represented by a number offeatures 128 which are readily extracted from image component 116 a ofexisting video resumes 112. In some cases, recognition and/or extractionof features 128 from image component 116 a may include use of machinevision techniques.

Still referring to FIG. 1 , in some embodiments, apparatus 100 mayinclude a machine vision process. A machine vision process may use imagecomponent 116 a from video resume 112, to make a determination aboutverbal 136 and/or non-verbal content 132. For example, in some cases amachine vision process may be used for world modeling or registration ofobjects within a space. In some cases, registration and/or featurerecognition may include image processing, such as without limitationobject recognition, feature 128 detection, edge/corner detection, andthe like. Non-limiting example of feature detection 128 may includescale invariant feature transform (SIFT), Canny edge detection, ShiTomasi corner detection, and the like. In some cases, a machine visionprocess may operate image classification and segmentation models, suchas without limitation by way of machine vision resource (e.g., OpenMV orTensorFlow Lite). A machine vision process may detect motion, forexample by way of frame differencing algorithms. A machine visionprocess may detect markers, for example blob detection, objectdetection, face detection, and the like. In some cases, a machine visionprocess may perform eye tracking (i.e., gaze estimation). In some cases,a machine vision process may perform person detection, for example byway of a trained machine learning model. In some cases, a machine visionprocess may perform motion detection (e.g., camera motion and/or objectmotion), for example by way of optical flow detection. In some cases,machine vision process may perform code (e.g., barcode) detection anddecoding. In some cases, a machine vision process may additionallyperform image capture and/or video recording.

Still referring to FIG. 1 , in some cases, machine vision process mayperform pose-estimation for example to ascertain a relative location ormovement of objects within existing video resumes to include one or moretransformations, for example to a view of a frame (or an image orexisting video resumes) relative a three-dimensional coordinate system;exemplary transformations include without limitation homographytransforms and affine transforms. In an embodiment, registration offirst frame to a coordinate system may be verified and/or correctedusing object identification and/or computer vision, as described above.For instance, and without limitation, an initial registration to twodimensions, represented for instance as registration to the x and ycoordinates, may be performed using a two-dimensional projection ofpoints in three dimensions onto a first frame, however. A thirddimension of registration, representing depth and/or a z axis, may bedetected by comparison of two frames; image recognition and/or edgedetection software may be used to detect multiple views of images of anobject (from subsequent frames) to derive a relative position along athird (z) axis. In some cases, solicitation video may include a stereoimage, having two stereoscopic views, which may be compared to derivez-axis values of points on object permitting, for instance, derivationof further z-axis points within and/or around the object usinginterpolation. Alternatively, or additionally, relative movement withinImage component 116(e.g., frame to frame) may be used to ascertainpositions of objects, even along a z-axis, for instance by way ofkinetic parallax. In some cases, relative motion of objects further awaymay occur at a different speed than objects nearby, this phenomenon maybe used to ascertain a position of objects relative a camera, forexample when the camera is moving. Object recognition and poseestimation may be repeated with multiple objects in field of view,including without a subject. In an embodiment, x and y axes may bechosen to span a plane common to a field of view of a camera used forsolicitation video image capturing and/or an xy plane of a first frame;a result, x and y translational components and ϕ may be pre-populated intranslational and rotational matrices, for affine transformation ofcoordinates of object, also as described above. Initial x and ycoordinates and/or guesses at transformational matrices mayalternatively or additionally be performed between first frame andsecond frame, as described above. For each point of a plurality ofpoints on object and/or edge and/or edges of object as described above,x and y coordinates of a first frame may be populated, with an initialestimate of z coordinates based, for instance, on assumptions aboutobject, such as an assumption that ground is substantially parallel toan xy plane as selected above. Z coordinates, and/or x, y, and zcoordinates, registered using image capturing and/or objectidentification processes as described above may then be compared tocoordinates predicted using initial guess at transformation matrices; anerror function may be computed using by comparing the two sets ofpoints, and new x, y, and/or z coordinates, may be iteratively estimatedand compared until the error function drops below a threshold level.

Still referring to FIG. 1 , in some cases, a machine vision process mayuse at least an image classifier, or any classifier described throughoutthis disclosure. As a non-limiting example, a machine vision process mayuse an image classifier, wherein the input is image component 116 a ofvideo resumes 112, and through a classification algorithm, outputs imagecomponents 116 a into categories based on training data, such assequential video resume 112 frames that match target video resume 108.Computing device 104 and/or another device may generate a classifierusing a classification algorithm, defined as a process whereby acomputing device derives a classifier from training data. Classificationmay be performed using, without limitation, linear classifiers such aswithout limitation logistic regression and/or I Bayes classifiers,nearest neighbor classifiers such as k-nearest neighbors classifiers,support vector machines, least squares support vector machines, fisher’slinear discriminant, quadratic classifiers, decision trees, boostedtrees, random forest classifiers, learning vector quantization, and/orneural network-based classifiers.

Still referring to FIG. 1 , computing device 104 may be configured togenerate a classifier using a Naïve Bayes classification algorithm.Naïve Bayes classification algorithm generates classifiers by assigningclass labels to problem instances, represented as vectors of elementvalues. Class labels are drawn from a finite set. Naïve Bayesclassification algorithm may include generating a family of algorithmsthat assume that the value of a particular element is independent of thevalue of any other element, given a class variable. Naïve Bayesclassification algorithm may be based on Bayes Theorem expressed asP(A/B)= P(B/A) P(A)÷P(B), where P(A/B) is the probability of hypothesisA given data B also known as posterior probability; P(B/A) is theprobability of data B given that the hypothesis A was true; P(A) is theprobability of hypothesis A being true regardless of data also known asprior probability of A; and P(B) is the probability of the dataregardless of the hypothesis. A naïve Bayes algorithm may be generatedby first transforming training data into a frequency table. Computingdevice 104 may then calculate a likelihood table by calculatingprobabilities of different data entries and classification labels.Computing device 104 may utilize a naive Bayes equation to calculate aposterior probability for each class. A class containing the highestposterior probability is the outcome of prediction. Naïve Bayesclassification algorithm may include a gaussian model that follows anormal distribution. Naïve Bayes classification algorithm may include amultinomial model that is used for discrete counts. Naïve Bayesclassification algorithm may include a Bernoulli model that may beutilized when vectors are binary.

With continued reference to FIG. 1 , computing device 104 may beconfigured to generate classifier 152 using a K-nearest neighbors (KNN)algorithm. A “K-nearest neighbors algorithm” as used in this disclosure,includes a classification method that utilizes feature similarity toanalyze how closely out-of-sample- features resemble training data toclassify input data to one or more clusters and/or categories offeatures as represented in training data; this may be performed byrepresenting both training data and input data in vector forms, andusing one or more measures of vector similarity to identifyclassifications within training data, and to determine a classificationof input data. K-nearest neighbors algorithm may include specifying aK-value, or a number directing the classifier to select the k mostsimilar entries training data to a given sample, determining the mostcommon classifier of the entries in the database, and classifying theknown sample; this may be performed recursively and/or iteratively togenerate a classifier that may be used to classify input data as furthersamples. For instance, an initial set of samples may be performed tocover an initial heuristic and/or “first guess” at an output and/orrelationship, which may be seeded, without limitation, using expertinput received according to any process as described herein. As anon-limiting example, an initial heuristic may include a ranking ofassociations between inputs and elements of training data. Heuristic mayinclude selecting some number of highest-ranking associations and/ortraining data elements.

With continued reference to FIG. 1 , generating k-nearest neighborsalgorithm may generate a first vector output containing a data entrycluster, generating a second vector output containing an input data, andcalculate the distance between the first vector output and the secondvector output using any suitable norm such as cosine similarity,Euclidean distance measurement, or the like. Each vector output may berepresented, without limitation, as an n-tuple of values, where n is atleast two values. Each value of n-tuple of values may represent ameasurement or other quantitative value associated with a given categoryof data, or attribute, examples of which are provided in further detailbelow; a vector may be represented, without limitation, in n-dimensionalspace using an axis per category of value represented in n-tuple ofvalues, such that a vector has a geometric direction characterizing therelative quantities of attributes in the n-tuple as compared to eachother. Two vectors may be considered equivalent where their directions,and/or the relative quantities of values within each vector as comparedto each other, are the same; thus, as a non-limiting example, a vectorrepresented as [5, 10, 15] may be treated as equivalent, for purposes ofthis disclosure, as a vector represented as [1, 2, 3]. Vectors may bemore similar where their directions are more similar, and more differentwhere their directions are more divergent; however, vector similaritymay alternatively or additionally be determined using averages ofsimilarities between like attributes, or any other measure of similaritysuitable for any n-tuple of values, or aggregation of numericalsimilarity measures for the purposes of loss functions as described infurther detail below. Any vectors as described herein may be scaled,such that each vector represents each attribute along an equivalentscale of values. Each vector may be “normalized,” or divided by a“length” attribute, such as a length attribute l as derived using aPythagorean norm:

$l = \sqrt{\sum_{i = 0}^{n}{a_{i}^{2},}}$

where a_(i) is attribute number I of the vector. Scaling and/ornormalization may function to make vector comparison independent ofabsolute quantities of attributes, while preserving any dependency onsimilarity of attributes; this may, for instance, be advantageous wherecases represented in training data are represented by differentquantities of samples, which may result in proportionally equivalentvectors with divergent values.

Still referring to FIG. 1 , in some cases, computing device associatesubject 120 with at least feature 128 from the target video resume 108by performing a text retrieval process as a function of at least keyword140. For example, in some cases, computing device 104 may query feature128 from target video resume 108 for presence of at least keyword 140.Querying feature 128 from target video resume 108 may include one ormore of word searching, phrase searching, proximity searching, full-textsearching, field (or metadata) searching, and the like. As used in thisdisclosure, a “target video feature keyword” is a word that is ofparticular relevance to an individual feature of target video resume.

With continued reference to FIG. 1 , in some embodiments, querying atleast image component 116 a-b may be performed with a text search, forexample using at least keyword 140 as a search term. Text search mayinclude techniques for searching a single computer-stored document or acollection of documents, for example in a database. Text search mayinclude full-text search. Full-text search may be distinguished fromsearches based on metadata or on field-based searching (e.g., fieldssuch as titles, abstracts, selected sections, or bibliographicalreferences). In an exemplary full-text search, computing device 104 mayexamine all words in every stored document as it tries to match searchcriteria (for example, keywords). Alternatively, a text search may belimited to fields, such as with field-based searching.

Still referring to FIG. 1 , in some embodiments, text searching mayinclude querying a database of video resumes in which multiple videoresumes 112 of subjects 120 are stored. As used in this disclosure,“video resume database” is a data structure configured to store dataassociated with a plurality of video resumes [...]. Database may beimplemented, without limitation, as a relational database, a key-valueretrieval database such as a NOSQL database, or any other format orstructure for use as a database that a person skilled in the art wouldrecognize as suitable upon review of the entirety of this disclosure.Database may alternatively or additionally be implemented using adistributed data storage protocol and/or data structure, such as adistributed hash table or the like. Database may include a plurality ofdata entries and/or records as described above. Data entries in adatabase may be flagged with or linked to one or more additionalelements of information, which may be reflected in data entry cellsand/or in linked tables such as tables related by one or more indices ina relational database. Persons skilled in the art, upon reviewing theentirety of this disclosure, will be aware of various ways in which dataentries in a database may store, retrieve, organize, and/or reflect dataand/or records as used herein, as well as categories and/or populationsof data consistently with this disclosure. In some cases, querying of atleast a video element may include any number of querying tools,including without limitation keywords (as described above),field-restricted search, Boolean queries, phrase search, concept search,concordance search, proximity search, regular expression, fuzzy search,wildcard search, and the like. In some cases, keywords may be used toperform a query. In some cases, a document (or trained indexers) maysupply a list of words that describe subject of the document, includingwithout limitation synonyms of words that describe the subject. In somecases, keywords may improve recall, for instance if the keyword listincludes a keyword that is not in text of a document. In some cases,querying tools may include field-restricted search. A field-restrictedsearch may allow a queries scope to be limited to within a particularfield within a stored data record, such as “Title” or “Author.” In somecases, a query tool may include Boolean queries. Searches that useBoolean operators (for example, “encyclopedia” AND “online” NOT“Encarta”) can dramatically increase precision of a search. In somecases, an AND operator may say, in effect, “Do not retrieve any documentunless it contains both of these terms.” In some cases, a NOT operatormay say, in effect, “Do not retrieve any document that contains thisword.” In some cases, a retrieval list retrieving too few documents, mayprompt and OR operator to be used in place of an AND operator toincrease recall; consider, for example, “encyclopedia” AND “online” OR“Internet” NOT “Encarta”. This search will retrieve documents aboutonline encyclopedias that use the term “Internet” instead of “online.”In some cases, search precision and recall are interdependent andnegatively correlated in text searching. In some cases, a query tool mayinclude phrase search. In some cases, a phrase search may match onlythose documents that contain a specified phrase. In some cases, a querytool may include a concept search. In some cases, a concept search maybe based on multi-word concepts, for example compound term processing.In some cases, a query tool may include a concordance search. In somecases, a concordance search may produce an alphabetical list of allprincipal words that occur in a text and may include their immediatecontext. In some cases, a query tool may include a proximity search. Insome cases, a proximity search matches only those documents that containtwo or more words that are separated by a specified number of words, arein the same sentence, or an in the same paragraph. A query tool mayinclude a regular expression. In some cases, a regular expression mayemploy a complex but powerful querying syntax that can be used tospecify retrieval conditions with precision, for instance databasesyntax. A query tool may include a fuzzy search. In some cases, a fuzzysearch may search for a document that matches given terms while allowingfor some variation around them. In some cases, a query tool may includea wildcard search. In some cases, a wildcard search may substitute oneor more characters in a search query for a wildcard character such as anasterisk. For example, using a wildcard, such as an asterisk, in asearch query “s*n” will search for terms inclusive of “sin,” “son,”“sun,” and the like.

Still referring to FIG. 1 , in some cases association of existing videoresume 112 with at least target video resume 108 may include one or moreof machine-learning process 144 and/or classifiers 152. Machine-learningprocess 144 may include any machine-learning process described in thisdisclosure. Classifier 152 may include any classifier described in thisdisclosure. In some cases, computing device 104 may associate existingvideo resume 112 with at least keyword 140 as a function of at leastfeature 128. As described above, at least feature 128 may representnon-verbal content 132 from existing video resume 112, at least feature128 may be correlated to at least target video resume 108, for exampleby way of one or more models (e.g., machine-learning models). In somecases, computing device 104 associate video resume 112 with at leasttarget video resume 108 by correlating, matching, or otherwise comparingsubject-specific data with description-specific data. In some cases, oneor both of subject-specific data and description-specific data may be atleast partially represented by features 128 which are inputs and/oroutputs of machine-learning processes. For example, a machine-learningmodel may be trained with deterministic or historical subject-specificdata correlated to description-specific data.

Still referring to FIG. 1 , as a function of the comparison result, aranking of the plurality of existing resume videos may be generated. Insome cases, generating the ranking of the plurality of existing resumesmay include linear regression techniques. Computing device 104 may bedesigned and configured to create a machine-learning model usingtechniques for development of linear regression models. Linearregression models may include ordinary least squares regression, whichaims to minimize the square of the difference between predicted outcomesand actual outcomes according to an appropriate norm for measuring sucha difference (e.g., a vector-space distance norm); coefficients of theresulting linear equation may be modified to improve minimization.Linear regression models may include ridge regression methods, where thefunction to be minimized includes the least-squares function plus termmultiplying the square of each coefficient by a scalar amount topenalize large coefficients. Linear regression models may include leastabsolute shrinkage and selection operator (LASSO) models, in which ridgeregression is combined with multiplying the least-squares term by afactor of 1 divided by double the number of samples. Linear regressionmodels may include a multi-task lasso model wherein the norm applied inthe least-squares term of the lasso model is the Frobenius normamounting to the square root of the sum of squares of all terms. Linearregression models may include the elastic net model, a multi-taskelastic net model, a least angle regression model, a LARS lasso model,an orthogonal matching pursuit model, a Bayesian regression model, alogistic regression model, a stochastic gradient descent model, aperceptron model, a passive aggressive algorithm, a robustnessregression model, a Huber regression model, or any other suitable modelthat may occur to persons skilled in the art upon reviewing the entiretyof this disclosure. Linear regression models may be generalized in anembodiment to polynomial regression models, whereby a polynomialequation (e.g., a quadratic, cubic or higher-order equation) providing abest predicted output/actual output fit is sought; similar methods tothose described above may be applied to minimize error functions, aswill be apparent to persons skilled in the art upon reviewing theentirety of this disclosure.

With continued reference to FIG. 1 , computing device 104 may beconfigured to determine, as a function of the comparison result, aduplication coefficient 156 for target resume video 108. As used in thisdisclosure, “duplication coefficient “is a quantitative value ofobserved similarities between two or more video resumes. Duplicationcoefficient 156 may be calculated or computed to provide a measure ormetric of similarity between target video resume 108 and at leastexisting resume video 112. Duplication coefficient 156 could stand for“how much”, “how many” or “how often” data appears in video resumes 112.The different categories of quantitative data could include,measurements, counts, calculations, sensors, projections, quantificationof qualitive data, and the like. For example, duplication coefficient156 may be the average of the observed subject-specific dataduplications of existing video resumes, divided by the total number ofexisting video resumes 112 in a database. Other examples, duplicationcoefficient could measure the number of subjects 120 appearing per videoresume 112. Duplication coefficient 156 could quantify how many subjects120 answered an interview question. Duplication coefficient could countsubjects 120 having similar technical backgrounds to target video resume108. Another example, duplication coefficient 156 could project subjects120 most qualified for a position based on their video resumes. In someembodiments, duplication coefficient 156 may include a metric range on ascale of 0, where 0 is least like target video resume 108, to 10, where10 is exact same as target video 108. It could also include a range ofpercentages and may cover any suitable range or rating score. In somecases, determining duplication coefficient 156 for target resume video108 may include linear regression analysis, machine-learning process,and the like. For example, duplication coefficient 156 may be calculatedby using classifier 152 configured to output at least a datum thatlabels or otherwise identifies a set of data that are clusteredtogether, found to be close under a distance metric. Additionally, oralternatively, duplication coefficient 156 may be an output of amachine-learning module 144 that generates classifier 152 using aclassification algorithm.

Still referring to FIG. 1 , computing device may utilize duplicationcoefficient 156 in determining which video resumes have value forstorage purposes. Duplication coefficient 156 may be used to measure thequantity of video resumes matching video elements of target video resume108. Duplication coefficient 156 may be used to select subject-specificdata among existing video resumes 112 and match the elements therein togive a range of similarity among all videos. For example, duplicationcoefficient 156 may be the metric range of similarity between existingvideo resumes 112 based on image component 116 a, audio component,non-verbal content 132, or verbal content 136. Computing device 104 maytake duplication coefficient of video resumes 112 image components 116 aand compare to image component 116 b of target video resume and give apercentage or other numeric value illustrating video resumes 112 imagecomponents that are most similar to target video resume 108 imagecomponent 116 b. Computing device 104 may utilize the duplicationcoefficient to disregard, delete, flag, video resumes 112 based on athreshold requirement set by the user or subject 120. For example, auser may set computing device 104 to disregard all video resumes withattributes 124 dissimilar to target video resume 108. A user could setcomputing 104 to delete video resumes 112 that that have an audiocompetent less than 50% similar to target video resume 108 audiocomponent. Computing device 104 may be set to flag video resumes 112that are missing and image or audio component in the video file.

Still referring to FIG. 1 , computing device 104 may utilize theduplication coefficient to output indications regarding video resume toa user, such as without limitations indications identifying flagged,matching, nonrelevant, and/or duplicative video resumes 112. Indicationsmay alternatively or additionally display a degree of similarity toanother video and/or video resume, may list one or more video resumesfound to be similar as a function of duplication coefficient, or thelike. In some cases, computing device 104 may utilize the duplicationcoefficient to automatically reject video resumes, or lower overallsubject’s 120 score, or other scores that measures subject’s 120desirability as an employee, for instance and without limitation basedon a threshold requirement of target video resume 108. For example,duplication coefficient 156 may be part of a scoring system thatgenerates a score by an algorithm based on user data records. Scoringsystem may include, without limitation, a career scoring systemproducing dynamic rating that represent, for instance in the form of anumerical score, a degree of employability, suitability for a job, andthe like. Additional disclosure related to the scoring system of subject120 may be found in U.S. Pat. Application No. 17/486,461 entitled“SYSTEMS AND METHODS FOR SCORE GENERATION FOR APPLICANT TRACKINGCROSS-REFERNCE TO RELATED APPLICATIONS” by W. Inman et al., which isincorporated in its entirety herein by reference.

Referring now to FIG. 2 , an exemplary frame of a video resume 200 isillustrated. Frame may include at least a portion of an image component.In some cases, a video resume may include many frames, for instance at aframe rate (e.g., 24, 30, 60, or 120 frames per second), therebyfacilitating video.

With continued reference to FIG. 2 , frame may include an image ofsubject. Alternatively, or additionally, in some cases video resume mayinclude images of other people or no person at all. An image of subjectmay be included in video resume.

Still referring to FIG. 2 , as more potential employees become moreliterate with video communication, video communication within a workenvironment may gain in popularity. Sharing one’s accomplishments andadvertising one’s services represents an important application ofcommunication for an individual jobseeker. As jobseekers gain in abilityand confidence in video communication, they will choose to put theirbest foot forward using the medium they believe best facilitatesadvantageous communication. As a result, some embodiments of the presentdisclosure inventively anticipate that video resumes will gain inpopularity and address an unmet need associated with video resumes.Presently, employers can screen many written resumes automatically.However, this technology cannot be used with video resumes. In somecase, embodiments described herein improve upon present resume screeningsystems by allowing the automatic parsing and correlating of videoresumes.

Referring now to FIG. 3 , a table 300 is depicted that illustrates anassociation between target video resume 304 and plurality of existingvideo resumes 308 a-d. Table 300 illustrates an image component oftarget video resume 304 being associated with plurality of existingvideo resumes 308 a-d.

With continued reference to FIG. 3 , table 300 may include a fewcolumns, for example a ranking column, video resume 308 a-d column, anda factor (e.g., comparison result 316) column. Table 300 may includemore than one factor column corresponding to other factors. For example,relevance, audio, text, video frame rate, and the like. Ranking 312 mayinclude a ranking of video resumes 308 a-d, for example from mostsimilar to target resume video resume 304, video resume 308 a, to aleast similar, video resume 308 d.

Referring now to FIG. 4 , an exemplary video resume database 400 isillustrated by way of a block diagram. Computing device 104 may becommunicatively connected with video resume database 400. For example,in some cases, video resume database 400 may be local to computingdevice 104. Alternatively, or additionally, in some cases, video resumedatabase 400 may be remote to computing device 104 and communicativewith computing device 104 by way of one or more networks. As used inthis disclosure, “video resume database” is a data structure configuredto store data associated with a plurality of video resumes 404. In somecases, data within video resume database 400 may be labeled to indicatewhich video resume 404 is associated with the data, for instance with aunique video resume identification number. Video resume database 400 maystore a plurality of video resumes 404. Video resumes 404 may beuploaded to video resume database 400 from at least a remote device.Remote device may include any computing device described in thisdisclosure. As described above with reference to FIG. 1 , video resumes404 may include image components 408, non-verbal contents 412, and/orverbal contents 416. Additionally, video resume database 400 may includevideo resume keywords 420. As described above, video resume keywords 420may be include words that were represented verbally within video resume404. Alternatively, or additionally, video resume keywords 420 may beassociated and/or classified to video resume 404 from non-verbal content412 or through user entry. Video resume database 400 may additionallystore video resume features 424 associated with video resumes 404. Asdescribed above, video resume features 424 may be recognized orextracted from video resumes 404 associated with a subject by way of anumber of processes described in detail in this disclosure.

Referring now to FIG. 5 , a training data database 500 is illustrated byway of a block diagram. Computing device 104 may be communicativelyconnected with video resume database 400. For example, in some cases,training data database 500 may be local to computing device 104.Alternatively, or additionally, in some cases, training data database500 may be remote to computing device 104 and communicative withcomputing device 104 by way of one or more networks. As used in thisdisclosure, “training data database” is a data structure configured tostore training data. As described in this disclosure, many embodimentsmay use training data to perform any number of processes. Training datamay, in some cases, need to be used, re-used, modified, deleted, and thelike. In some cases, training data may need to be conditionallyselected. Additionally, training data sets may be stored in trainingdata database 500. For example, in some cases, training data that may beused to extract data (e.g., keywords and/or features), for example fromvideo resumes may be stored in training data database 500; thesetraining sets may include without limitation visual verbal contenttraining data 504, visual non-verbal content training data 508.

Referring now to FIG. 6 , an exemplary embodiment of a machine-learningmodule 600 that may perform one or more machine-learning processes asdescribed in this disclosure is illustrated. Machine-learning module 600may perform determinations, classification, and/or analysis steps,methods, processes, or the like as described in this disclosure usingmachine learning processes. A “machine learning process,” as used inthis disclosure, is a process that automatedly uses training data togenerate an algorithm that will be performed by a computingdevice/module to produce outputs given data provided as inputs; this isin contrast to a non-machine learning software program where thecommands to be executed are determined in advance by a user and writtenin a programming language.

Still referring to FIG. 6 , “training data,” as used herein, is datacontaining correlations that a machine-learning process may use to modelrelationships between two or more categories of data elements. Forinstance, and without limitation, training data 604 may include aplurality of data entries, each entry representing a set of dataelements that were recorded, received, and/or generated together; dataelements may be correlated by shared existence in a given data entry, byproximity in a given data entry, or the like. Multiple data entries intraining data may evince one or more trends in correlations betweencategories of data elements; for instance, and without limitation, ahigher value of a first data element belonging to a first category ofdata element may tend to correlate to a higher value of a second dataelement belonging to a second category of data element, indicating apossible proportional or other mathematical relationship linking valuesbelonging to the two categories. Multiple categories of data elementsmay be related in training data according to various correlations;correlations may indicate causative and/or predictive links betweencategories of data elements, which may be modeled as relationships suchas mathematical relationships by machine-learning processes as describedin further detail below. Training data 604 may be formatted and/ororganized by categories of data elements, for instance by associatingdata elements with one or more descriptors corresponding to categoriesof data elements. As a non-limiting example, training data 604 mayinclude data entered in standardized forms by persons or processes, suchthat entry of a given data element in a given field in a form may bemapped to one or more descriptors of categories. Elements in trainingdata 604 may be linked to descriptors of categories by tags, tokens, orother data elements; for instance, and without limitation, training data604 may be provided in fixed-length formats, formats linking positionsof data to categories such as comma-separated value (CSV) formats and/orself-describing formats such as extensible markup language (XML),JavaScript Object Notation (JSON), or the like, enabling processes ordevices to detect categories of data.

Alternatively, or additionally, and continuing to refer to FIG. 6 ,training data 604 may include one or more elements that are notcategorized; that is, training data 604 may not be formatted or containdescriptors for some elements of data. Machine-learning algorithmsand/or other processes may sort training data 604 according to one ormore categorizations using, for instance, natural language processingalgorithms, tokenization, detection of correlated values in raw data andthe like; categories may be generated using correlation and/or otherprocessing algorithms. As a non-limiting example, in a corpus of text,phrases making up a number “n” of compound words, such as nouns modifiedby other nouns, may be identified according to a statisticallysignificant prevalence of n-grams containing such words in a particularorder; such an n-gram may be categorized as an element of language suchas a “word” to be tracked similarly to single words, generating a newcategory as a result of statistical analysis. Similarly, in a data entryincluding some textual data, a person’s name may be identified byreference to a list, dictionary, or other compendium of terms,permitting ad-hoc categorization by machine-learning algorithms, and/orautomated association of data in the data entry with descriptors or intoa given format. The ability to categorize data entries automatedly mayenable the same training data 604 to be made applicable for two or moredistinct machine-learning algorithms as described in further detailbelow. Training data 604 used by machine-learning module 600 maycorrelate any input 612 data as described in this disclosure to anyoutput 608 data as described in this disclosure. As a non-limitingillustrative input may include subject-specific data and outputs mayinclude description-specific data.

Further referring to FIG. 6 , training data 604 may be filtered, sorted,and/or selected using one or more supervised and/or unsupervisedmachine-learning processes 632 and/or models as described in furtherdetail below; such models may include without limitation a training dataclassifier 616. Training data classifier 616 may include a “classifier,”which as used in this disclosure is a machine-learning model 624 asdefined below, such as a mathematical model, neural net, or programgenerated by a machine learning algorithm known as a “classificationalgorithm,” as described in further detail below, that sorts inputs intocategories or bins of data, outputting the categories or bins of dataand/or labels associated therewith. A classifier may be configured tooutput 608 at least a datum that labels or otherwise identifies a set ofdata that are clustered together, found to be close under a distancemetric as described below, or the like. Machine-learning module 600 maygenerate a classifier using a classification algorithm, defined as aprocess whereby a computing device and/or any module and/or componentoperating thereon derives a classifier from training data.Classification may be performed using, without limitation, linearclassifiers such as without limitation logistic regression and/or naiveBayes classifiers, nearest neighbor classifiers such as k-nearestneighbors classifiers, support vector machines, least squares supportvector machines, fisher’s linear discriminant, quadratic classifiers,decision trees, boosted trees, random forest classifiers, learningvector quantization, and/or neural network-based classifiers. As anon-limiting example, training data classifier 616 may classify elementsof training data 604 to according to fields of target video resume forinstance, title, role, organization, requisite experience, requisitecredentials, and the like.

Still referring to FIG. 6 , machine-learning module 600 may beconfigured to perform a lazy-learning process and/or protocol, which mayalternatively be referred to as a “lazy loading” or “call-when-needed”process and/or protocol, may be a process whereby machine learning isconducted upon receipt of an input 612 to be converted to an output 608,by combining the input 612 and training set to derive the algorithm tobe used to produce the output 608 on demand. For instance, an initialset of simulations may be performed to cover an initial heuristic and/or“first guess” at an output and/or relationship. As a non-limitingexample, an initial heuristic may include a ranking of associationsbetween inputs and elements of training data. Heuristic may includeselecting some number of highest-ranking associations and/or trainingdata elements. Lazy learning 620 may implement any suitable lazylearning algorithm, including without limitation a K-nearest neighborsalgorithm, a lazy naive Bayes algorithm, or the like; persons skilled inthe art, upon reviewing the entirety of this disclosure, will be awareof various lazy-learning algorithms that may be applied to generateoutputs as described in this disclosure, including without limitationlazy learning applications of machine-learning algorithms as describedin further detail below.

Alternatively, or additionally, and with continued reference to FIG. 6 ,machine-learning processes as described in this disclosure may be usedto generate machine-learning models. A “machine-learning model,” as usedin this disclosure, is a mathematical and/or algorithmic representationof a relationship between inputs and outputs, as generated using anymachine-learning process including without limitation any process asdescribed above and stored in memory; an input 612 is submitted to amachine-learning model 624 once created, which generates an output 608based on the relationship that was derived. For instance, and withoutlimitation, a linear regression model, generated using a linearregression algorithm, may compute a linear combination of input 612 datausing coefficients derived during machine-learning processes tocalculate an output datum. As a further non-limiting example, amachine-learning model may be generated by creating an artificial neuralnetwork, such as a convolutional neural network comprising an input 612layer of nodes, one or more intermediate layers, and an output 608 layerof nodes. Connections between nodes may be created via the process of“training” the network, in which elements from a training data 804 setare applied to the input 612 nodes, a suitable training algorithm (suchas Levenberg-Marquardt, conjugate gradient, simulated annealing, orother algorithms) is then used to adjust the connections and weightsbetween nodes in adjacent layers of the neural network to produce thedesired values at the output 608 nodes. This process is sometimesreferred to as deep learning.

Still referring to FIG. 6 , machine-learning algorithms may include atleast a supervised machine-learning process 628. At least a supervisedmachine-learning process 628, as defined herein, include algorithms thatreceive a training set relating a number of inputs to a number ofoutputs, and seek to find one or more mathematical relations relatinginputs to outputs, where each of the one or more mathematical relationsis optimal according to some criterion specified to the algorithm usingsome scoring function. For instance, a supervised learning algorithm mayinclude subject-specific data as described above as inputs,description-specific data as outputs, and a scoring functionrepresenting a desired form of relationship to be detected betweeninputs and outputs; scoring function may, for instance, seek to maximizethe probability that a given input and/or combination of elements inputsis associated with a given output to minimize the probability that agiven input is not associated with a given output. Scoring function maybe expressed as a risk function representing an “expected loss” of analgorithm relating inputs to outputs, where loss is computed as an errorfunction representing a degree to which a prediction generated by therelation is incorrect when compared to a given input-output pairprovided in training data. Persons skilled in the art, upon reviewingthe entirety of this disclosure, will be aware of various possiblevariations of at least a supervised machine-learning process 628 thatmay be used to determine relation between inputs and outputs. Supervisedmachine-learning processes may include classification algorithms asdefined above.

Further referring to FIG. 6 , machine learning processes may include atleast an unsupervised machine-learning processes 632. An unsupervisedmachine-learning process 632, as used herein, is a process that derivesinferences in datasets without regard to labels; as a result, anunsupervised machine-learning process 632 may be free to discover anystructure, relationship, and/or correlation provided in the data.Unsupervised processes may not require a response variable; unsupervisedprocesses may be used to find interesting patterns and/or inferencesbetween variables, to determine a degree of correlation between two ormore variables, or the like.

Still referring to FIG. 6 , machine-learning module 600 may be designedand configured to create a machine-learning model 624 using techniquesfor development of linear regression models. Linear regression modelsmay include ordinary least squares regression, which aims to minimizethe square of the difference between predicted outcomes and actualoutcomes according to an appropriate norm for measuring such adifference (e.g., a vector-space distance norm); coefficients of theresulting linear equation may be modified to improve minimization.Linear regression models may include ridge regression methods, where thefunction to be minimized includes the least-squares function plus termmultiplying the square of each coefficient by a scalar amount topenalize large coefficients. Linear regression models may include leastabsolute shrinkage and selection operator (LASSO) models, in which ridgeregression is combined with multiplying the least-squares term by afactor of 1 divided by double the number of samples. Linear regressionmodels may include a multi-task lasso model wherein the norm applied inthe least-squares term of the lasso model is the Frobenius normamounting to the square root of the sum of squares of all terms. Linearregression models may include the elastic net model, a multi-taskelastic net model, a least angle regression model, a LARS lasso model,an orthogonal matching pursuit model, a Bayesian regression model, alogistic regression model, a stochastic gradient descent model, aperceptron model, a passive aggressive algorithm, a robustnessregression model, a Huber regression model, or any other suitable modelthat may occur to persons skilled in the art upon reviewing the entiretyof this disclosure. Linear regression models may be generalized in anembodiment to polynomial regression models, whereby a polynomialequation (e.g., a quadratic, cubic or higher-order equation) providing abest predicted output/actual output fit is sought; similar methods tothose described above may be applied to minimize error functions, aswill be apparent to persons skilled in the art upon reviewing theentirety of this disclosure.

Continuing to refer to FIG. 6 , machine-learning algorithms may include,without limitation, linear discriminant analysis. Machine-learningalgorithm may include quadratic discriminate analysis. Machine-learningalgorithms may include kernel ridge regression. Machine-learningalgorithms may include support vector machines, including withoutlimitation support vector classification-based regression processes.Machine-learning algorithms may include stochastic gradient descentalgorithms, including classification and regression algorithms based onstochastic gradient descent. Machine-learning algorithms may includenearest neighbors algorithms. Machine-learning algorithms may includevarious forms of latent space regularization such as variationalregularization. Machine-learning algorithms may include Gaussianprocesses such as Gaussian Process Regression. Machine-learningalgorithms may include cross-decomposition algorithms, including partialleast squares and/or canonical correlation analysis. Machine-learningalgorithms may include naïve Bayes methods. Machine-learning algorithmsmay include algorithms based on decision trees, such as decision treeclassification or regression algorithms. Machine-learning algorithms mayinclude ensemble methods such as bagging meta-estimator, forest ofrandomized tress, AdaBoost, gradient tree boosting, and/or votingclassifier methods. Machine-learning algorithms may include neural netalgorithms, including convolutional neural net processes.

Referring now to FIG. 7 , an exemplary method of parsing and comparingresume video duplications is illustrated by way of a flow diagram. Atstep 705, method may include acquiring, using a computing device, aplurality of video elements from a target video resume. This may beaccomplished without limitation as described above in reference to FIGS.1-6 , wherein target video includes at least an image component. In somecases, image component may include non-verbal or verbal content, forexample with reference to FIG. 1 . Computing device may include anycomputing device as described in this disclosure, for example withreference to FIGS. 1 and 4-6 . Target video resume may include anytarget video resume described in this disclosure, for example withreference to FIGS. 1-6 . Subject may include any subject described inthis disclosure, for example with reference to FIG. 1 .

With continued reference to FIG. 7 , at step 710, method may includecomparing, using a computing device, at least an existing video resumeand the target video resume video resume for similarities. Existingvideo resume may be any video resume as described above in reference toFIGS. 1 and 4 . In some cases, computing device may obtain a comparisonresult of video resume and target video resume. The comparison resultmay contain a comparison score that represents a degree of similaritybetween target video resume and an existing video resume of theplurality of existing video resumes, as described with reference toFIGS. 1 and 3 . In some cases, the comparison score may be determined bydynamic time warping based on a similarity matrix, as disclosed in FIG.1 . As a function of the comparison result, a ranking of the pluralityof existing resume videos is may be generated. In some cases, generatingthe ranking of the plurality of existing resumes may include linearregression techniques, as disclosed in FIG. 1 .

With continued reference to FIG. 7 , at step 715 method may includedetermining, using computing device, as a function of the comparisonresult, a duplication coefficient for target resume video, wherein theduplication coefficient relates to a similarity between target videoresume and at least an existing resume video, as dislcosed in FIG. 1 .Duplication coefficient may represent any quantitative value asdisclosed in FIG. 1 and may be calculated through a computing device ormachine-learning process as disclosed in FIG. 1 .

With continued reference to FIG. 7 , in some embodiments, method 700 mayinclude associating, using computing device, video resume with targetvideo resume as a function of at least a keyword, as described withreference to FIG. 1 . In some embodiments, method may additionallyinclude classifying, using computing device and at least a candidateclassifier, video resume to the target video resume. Candidateclassifier may include any classifier described in this disclosure, forexample including with reference to FIG. 1 . In some embodiments, method700 may additionally include querying, using computing device, a videoresume with at least a keyword. In some embodiments, method 700 mayadditionally include determining relevance as a function of associationbetween video resume and target video resume, for example including withreference to FIGS. 1 and 3 .

Still referring to FIG. 7 , in some embodiments, at least an imagecomponent may include visual non-verbal content related to at least anattribute of subject. Visual non-verbal content may include any visualnon-verbal content described in this disclosure, for example includingwith reference to FIG. 1 . Attribute may include any attribute describedin this disclosure, for example including with reference to FIG. 1 . Insome cases, method 700 may additionally include recognizing, usingcomputing device, at least a feature as a function of visual non-verbalcontent and associating, using the computing device, subject with targetvideo resume as a function of the at least a feature. Feature mayinclude any feature described in this disclosure, for example includingwith reference to FIG. 1 . In some cases, recognizing at least a featureand associating video resume with target video resume as a function ofthe at least a feature may use a machine-learning process.Machine-learning process may include any machine learning processdescribed in this disclosure, for example including with reference toFIGS. 1 and 5-6 .

Still referring to FIG. 7 , in some embodiments at least an imagecomponent may include visual verbal content. In some cases, method 700may additionally include recognizing, using computing device, at least akeyword as a function of visual verbal content and associating, usingthe computing device, video resume with target video resume as afunction of the at least a keyword. Visual verbal content may includeany visual verbal content described in this disclosure, for exampleincluding with reference to FIG. 1 .

Still referring to FIG. 7 , in some embodiments at least an audiocomponent may include audible non-verbal content related to at least anattribute of subject. Audible non-verbal content may include any audiblenon-verbal content described in this disclosure, for example includingwith reference to FIG. 1 . In some cases, method 700 may additionallyinclude recognizing, using computing device, at least a feature as afunction of audible non-verbal content and associating, using thecomputing device, video resume with target video resume as a function ofthe at least a feature.

Referring now to FIG. 8 , an exemplary embodiment of neural network 800is illustrated. A neural network 800 also known as an artificial neuralnetwork, is a network of “nodes,” or data structures having one or moreinputs, one or more outputs, and a function determining outputs based oninputs. Such nodes may be organized in a network, such as withoutlimitation a convolutional neural network, including an input layer ofnodes, one or more intermediate layers, and an output layer of nodes.Connections between nodes may be created via the process of “training”the network, in which elements from a training dataset are applied tothe input nodes, a suitable training algorithm (such asLevenberg-Marquardt, conjugate gradient, simulated annealing, or otheralgorithms) is then used to adjust the connections and weights betweennodes in adjacent layers of the neural network to produce the desiredvalues at the output nodes. This process is sometimes referred to asdeep learning. Connections may run solely from input nodes toward outputnodes in a “feed-forward” network or may feed outputs of one layer backto inputs of the same or a different layer in a “recurrent network.”

Referring now to FIG. 9 , an exemplary embodiment of a node of a neuralnetwork is illustrated. A node may include, without limitation aplurality of inputs xi that may receive numerical values from inputs toa neural network containing the node and/or from other nodes. Node mayperform a weighted sum of inputs using weights wi that are multiplied byrespective inputs xi. Additionally, or alternatively, a bias b may beadded to the weighted sum of the inputs such that an offset is added toeach unit in the neural network layer that is independent of the inputto the layer. The weighted sum may then be input into a function φ,which may generate one or more outputs y. Weight wi applied to an inputxi may indicate whether the input is “excitatory,” indicating that ithas strong influence on the one or more outputs y, for instance by thecorresponding weight having a large numerical value, and/or a“inhibitory,” indicating it has a weak effect influence on the one moreinputs y, for instance by the corresponding weight having a smallnumerical value. The values of weights wi may be determined by traininga neural network using training data, which may be performed using anysuitable process as described above.

It is to be noted that any one or more of the aspects and embodimentsdescribed herein may be conveniently implemented using one or moremachines (e.g., one or more computing devices that are utilized as auser computing device for an electronic document, one or more serverdevices, such as a document server, etc.) programmed according to theteachings of the present specification, as will be apparent to those ofordinary skill in the computer art. Appropriate software coding canreadily be prepared by skilled programmers based on the teachings of thepresent disclosure, as will be apparent to those of ordinary skill inthe software art. Aspects and implementations discussed above employingsoftware and/or software modules may also include appropriate hardwarefor assisting in the implementation of the machine executableinstructions of the software and/or software module.

Such software may be a computer program product that employs amachine-readable storage medium. A machine-readable storage medium maybe any medium that is capable of storing and/or encoding a sequence ofinstructions for execution by a machine (e.g., a computing device) andthat causes the machine to perform any one of the methodologies and/orembodiments described herein. Examples of a machine-readable storagemedium include, but are not limited to, a magnetic disk, an optical disc(e.g., CD, CD-R, DVD, DVD-R, etc.), a magneto-optical disk, a read-onlymemory “ROM” device, a random access memory “RAM” device, a magneticcard, an optical card, a solid-state memory device, an EPROM, an EEPROM,and any combinations thereof. A machine-readable medium, as used herein,is intended to include a single medium as well as a collection ofphysically separate media, such as, for example, a collection of compactdiscs or one or more hard disk drives in combination with a computermemory. As used herein, a machine-readable storage medium does notinclude transitory forms of signal transmission.

Such software may also include information (e.g., data) carried as adata signal on a data carrier, such as a carrier wave. For example,machine-executable information may be included as a data-carrying signalembodied in a data carrier in which the signal encodes a sequence ofinstruction, or portion thereof, for execution by a machine (e.g., acomputing device) and any related information (e.g., data structures anddata) that causes the machine to perform any one of the methodologiesand/or embodiments described herein.

Examples of a computing device include, but are not limited to, anelectronic book reading device, a computer workstation, a terminalcomputer, a server computer, a handheld device (e.g., a tablet computer,a smartphone, etc.), a web appliance, a network router, a networkswitch, a network bridge, any machine capable of executing a sequence ofinstructions that specify an action to be taken by that machine, and anycombinations thereof. In one example, a computing device may includeand/or be included in a kiosk.

FIG. 10 shows a diagrammatic representation of one embodiment of acomputing device in the exemplary form of a computer apparatus 1000within which a set of instructions for causing a control system toperform any one or more of the aspects and/or methodologies of thepresent disclosure may be executed. It is also contemplated thatmultiple computing devices may be utilized to implement a speciallyconfigured set of instructions for causing one or more of the devices toperform any one or more of the aspects and/or methodologies of thepresent disclosure. Computer apparatus 1000 includes a processor 1004and a memory 1008 that communicate with each other, and with othercomponents, via a bus 1012. Bus 1012 may include any of several types ofbus structures including, but not limited to, a memory bus, a memorycontroller, a peripheral bus, a local bus, and any combinations thereof,using any of a variety of bus architectures.

Processor 1004 may include any suitable processor, such as withoutlimitation a processor incorporating logical circuitry for performingarithmetic and logical operations, such as an arithmetic and logic unit(ALU), which may be regulated with a state machine and directed byoperational inputs from memory and/or sensors; processor 1004 may beorganized according to Von Neumann and/or Harvard architecture as anon-limiting example. Processor 1004 may include, incorporate, and/or beincorporated in, without limitation, a microcontroller, microprocessor,digital signal processor (DSP), Field Programmable Gate Array (FPGA),Complex Programmable Logic Device (CPLD), Graphical Processing Unit(GPU), general purpose GPU, Tensor Processing Unit (TPU), analog ormixed signal processor, Trusted Platform Module (TPM), a floating pointunit (FPU), and/or system on a chip (SoC).

Memory 1008 may include various components (e.g., machine-readablemedia) including, but not limited to, a random-access memory component,a read only component, and any combinations thereof. In one example, abasic input/output system 1016 (BIOS), including basic routines thathelp to transfer information between elements within computer apparatus1000, such as during start-up, may be stored in memory 1008. Memory 1008may also include (e.g., stored on one or more machine-readable media)instructions (e.g., software) 1020 embodying any one or more of theaspects and/or methodologies of the present disclosure. In anotherexample, memory 1008 may further include any number of program modulesincluding, but not limited to, an operating system, one or moreapplication programs, other program modules, program data, and anycombinations thereof.

Computer apparatus 1000 may also include a storage device 1024. Examplesof a storage device (e.g., storage device 1024) include, but are notlimited to, a hard disk drive, a magnetic disk drive, an optical discdrive in combination with an optical medium, a solid-state memorydevice, and any combinations thereof. Storage device 1024 may beconnected to bus 1012 by an appropriate interface (not shown). Exampleinterfaces include, but are not limited to, SCSI, advanced technologyattachment (ATA), serial ATA, universal serial bus (USB), IEEE 1394(FIREWIRE), and any combinations thereof. In one example, storage device1024 (or one or more components thereof) may be removably interfacedwith computer apparatus 1000 (e.g., via an external port connector (notshown)). Particularly, storage device 1024 and an associatedmachine-readable medium 1028 may provide nonvolatile and/or volatilestorage of machine-readable instructions, data structures, programmodules, and/or other data for computer apparatus 1000. In one example,software 1020 may reside, completely or partially, withinmachine-readable medium 1028. In another example, software 1020 mayreside, completely or partially, within processor 1004.

Computer apparatus 1000 may also include an input device 1032. In oneexample, a user of computer apparatus 1000 may enter commands and/orother information into computer apparatus 1000 via input device 1032.Examples of an input device 1032 include, but are not limited to, analpha-numeric input device (e.g., a keyboard), a pointing device, ajoystick, a gamepad, an audio input device (e.g., a microphone, a voiceresponse system, etc.), a cursor control device (e.g., a mouse), atouchpad, an optical scanner, a video capture device (e.g., a stillcamera, a video camera), a touchscreen, and any combinations thereof.Input device 1032 may be interfaced to bus 1012 via any of a variety ofinterfaces (not shown) including, but not limited to, a serialinterface, a parallel interface, a game port, a USB interface, aFIREWIRE interface, a direct interface to bus 1012, and any combinationsthereof. Input device 1032 may include a touch screen interface that maybe a part of or separate from display 1036, discussed further below.Input device 1032 may be utilized as a user selection device forselecting one or more graphical representations in a graphical interfaceas described above.

A user may also input commands and/or other information to computerapparatus 1000 via storage device 1024 (e.g., a removable disk drive, aflash drive, etc.) and/or network interface device 1040. A networkinterface device, such as network interface device 1040, may be utilizedfor connecting computer apparatus 1000 to one or more of a variety ofnetworks, such as network 1044, and one or more remote devices 1048connected thereto. Examples of a network interface device include, butare not limited to, a network interface card (e.g., a mobile networkinterface card, a LAN card), a modem, and any combination thereof.Examples of a network include, but are not limited to, a wide areanetwork (e.g., the Internet, an enterprise network), a local areanetwork (e.g., a network associated with an office, a building, a campusor other relatively small geographic space), a telephone network, a datanetwork associated with a telephone/voice provider (e.g., a mobilecommunications provider data and/or voice network), a direct connectionbetween two computing devices, and any combinations thereof. A network,such as network 1044, may employ a wired and/or a wireless mode ofcommunication. In general, any network topology may be used. Information(e.g., data, software 1020, etc.) may be communicated to and/or fromcomputer apparatus 1000 via network interface device 1040.

Computer apparatus 1000 may further include a video display adapter 1052for communicating a displayable image to a display device, such asdisplay device 1036. Examples of a display device include, but are notlimited to, a liquid crystal display (LCD), a cathode ray tube (CRT), aplasma display, a light emitting diode (LED) display, and anycombinations thereof. Display adapter 1052 and display device 1036 maybe utilized in combination with processor 1004 to provide graphicalrepresentations of aspects of the present disclosure. In addition to adisplay device, computer apparatus 1000 may include one or more otherperipheral output devices including, but not limited to, an audiospeaker, a printer, and any combinations thereof. Such peripheral outputdevices may be connected to bus 1012 via a peripheral interface 1056.Examples of a peripheral interface include, but are not limited to, aserial port, a USB connection, a FIREWIRE connection, a parallelconnection, and any combinations thereof.

The foregoing has been a detailed description of illustrativeembodiments of the invention. Various modifications and additions can bemade without departing from the spirit and scope of this invention.Features of each of the various embodiments described above may becombined with features of other described embodiments as appropriate inorder to provide a multiplicity of feature combinations in associatednew embodiments. Furthermore, while the foregoing describes a number ofseparate embodiments, what has been described herein is merelyillustrative of the application of the principles of the presentinvention. Additionally, although particular methods herein may beillustrated and/or described as being performed in a specific order, theordering is highly variable within ordinary skill to achieve methods andapparatuses according to the present disclosure. Accordingly, thisdescription is meant to be taken only by way of example, and not tootherwise limit the scope of this invention.

Exemplary embodiments have been disclosed above and illustrated in theaccompanying drawings. It will be understood by those skilled in the artthat various changes, omissions and additions may be made to that whichis specifically disclosed herein without departing from the spirit andscope of the present invention.

What is claimed is:
 1. An apparatus for parsing and comparing resumevideo duplications, the apparatus comprising: at least a processor; anda memory communicatively connected to the at least a processor, thememory containing instructions configuring the at least a processor to:acquire a plurality of video elements from an existing video resume,wherein the existing video resume comprises at least an image component;recognize subject-specific data of the existing video resume as afunction of the at least an image component, wherein thesubject-specific data comprises verbal content and non-verbal content;recognize at least a keyword of the existing video resume as a functionof the subject-specific data; recognize at least a feature of theexisting video resume as a function of the subject-specific data;compare the subject-specific data to a target video resume; anddetermine, as a function of the comparison result, a duplicationcoefficient for the target resume video.
 2. The apparatus of claim 1,wherein the at least a processor is further configured to recognize theat least a keyword of the existing video resume as a function of the atleast an image component using optical character recognition.
 3. Theapparatus of claim 1, wherein the at least a processor is furtherconfigured to recognize the at least a feature of the existing videoresume as a function of the verbal content of the subject-specific dataof the existing video resume.
 4. The apparatus of claim 1, wherein theat least a processor is further configured to recognize the verbalcontent of the existing video resume using an audiovisual speechrecognition process.
 5. The apparatus of claim 1, wherein the at least aprocessor is further configured to recognize the verbal content and thenon-verbal content as a function of the at least an image componentusing a machine vision process.
 6. The apparatus of claim 1, wherein theat least a processor is further configured to query a target videofeature keyword of the target video resume.
 7. The apparatus of claim 6,wherein the at least a processor is further configured to query the atleast an image component of the existing video resume as a function ofthe target video feature keyword.
 8. The apparatus of claim 1, whereinthe at least a processor is further configured to classify thenon-verbal content of the existing video resume to the target videousing a non-verbal classifier.
 9. The apparatus of claim 1, whereincomparing the subject-specific data to the target video resume comprisescomparing the subject-specific data to job descriptive data using acandidate classifier.
 10. The apparatus of claim 9, wherein the at leasta processor is further configured to rank two or more existing videoresumes based on a relevance to the target video resume using thecandidate classifier.
 11. A method for parsing and comparing resumevideo duplications, the method comprising: acquiring, using at least aprocessor, a plurality of video elements from an existing video resume,wherein the existing video resume comprises at least an image component;recognizing, using the at least a processor, subject-specific data ofthe existing video resume as a function of the at least an imagecomponent, wherein the subject-specific data comprises the verbalcontent and the non-verbal content; recognizing, using the at least aprocessor, the at least a keyword of the existing video resume as afunction of the subject-specific data; recognizing, using the at least aprocessor, the at least a feature of the existing video resume as afunction of the subject-specific data; comparing, using the at least aprocessor, the subject-specific data to a target video resume; anddetermining, using the at least a processor and as a function of thecomparison result, a duplication coefficient for the target resumevideo.
 12. The method of claim 11, further comprising: recognizing,using the at least a processor, the at least a keyword of the existingvideo resume as a function of the at least an image component usingoptical character recognition.
 13. The method of claim 11, furthercomprising: recognizing, using the at least a processor, the at least afeature of the existing video resume as a function of the verbal contentof the subject-specific data of the existing video resume.
 14. Themethod of claim 11, further comprising: recognizing, using the at leasta processor, the verbal content of the existing video resume using anaudiovisual speech recognition process.
 15. The method of claim 11,further comprising: recognizing, using the at least a processor, theverbal content and the non-verbal content as a function of the at leastan image component using a machine vision process.
 16. The method ofclaim 11, further comprising querying, using the at least a processor, atarget video feature keyword of the target video resume.
 17. The methodof claim 16, further comprising querying, using the at least aprocessor, the at least an image component of the existing video resumeas a function of the target video feature keyword.
 18. The method ofclaim 16, further comprising: classifying, using the at least aprocessor, the non-verbal content of the existing video resume to thetarget video using a non-verbal classifier.
 19. The method of claim 11,wherein comparing the subject-specific data to the target video resumecomprises comparing the subject-specific data to job descriptive datausing a candidate classifier.
 20. The method of claim 19, furthercomprising: ranking, using the at least a processor, two or moreexisting video resumes based on a relevance to the target video resumeusing the candidate classifier.